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The California Environmental Data Center is concerned with increasing the 
uses and benefits of the Landsat program. Landsat imagery and the informa- 
tion which can be obtained from Landsat can play Important roles in the in- 
formation systems of many public agencies, but this is not now widely recognized. 
VHiile Landsat imagery has been used in a number of large scale projects, its 
use in smaller projects, especially those closely tied to agency's ongoing 
and day-to-day operations, has been much less frequent. For most such agencies, 

Landsat data can play only a secondary role as a data gathering system; they 

% 

rely for most of their data on more conventional sources. This current inade- 
quacy in the present system of coordination and availability does not allow 
this valuable data to be used to its full potential. At the federal level, 

NASA is concerned with the same issue. 

A central theme In work undertaken by these two agencies is efficient 
gathering of environmental data for management of our natural resources, and 
the transfer of necessary technologies to appropriate levels of both govern- 
ment and private enterprise. 

A joint effort designed to address the needs of both agencies has resulted 
in the initiation of The’ California Integrated Remote Sensing System (CIRSS) 
Project. This project is sponsored by NASA and involves three studies: 

1. Vertical Integration 

2. State User Needs 

3. Landsat Demonstration Projects 

Environmental Systems Research Institute (ESRI) of Redlands, California, 
is a research and consulting organization that centers on the analysis and 
interpretation of geographic information for planning and decision making. 
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ESRI recently participated in a pilot project with NASA Ames Research 
Center (NASA/ARC) which was designed to integrate Landsat imagery with 
an automated geo-based information system (GBIS) . While ESRI currently 
uses Landsat photographic products as collateral information in the creation 
of GBIS data bases by photo interpretation, the two systems have not been 
operationally integrated and used together • If Landsat data is to be an 
effective part of public agencies and private sector information systems, 
Landsat data must be integrated into the existing data systems. Moreover, 
it must be integrated in ways which are as compatible as possible with •the 
existing system, requiring the least change on the part of the user’s 
existing procedures. 

The Vertical Data Integration Study under CIRSS is designed to accont- 
plish these purposes. It begins with user needs and goes from these to the 
data available from Landsat. It is designed to produce useful products to 
the agencies and to the private sector, in formats convenient for their 
operational use. 

Vertical data integration under optimum conditions, high technology 
operations or operations requiring large scale bulk processing would be 
performed at a central location by higher organizational levels. Lower 
technology efforts that emphasize ancillary data and intense field work or 
review would be performed at lower levels. The exact mix of function and 
organizational level would likely vary with data sets and organizational 
context. This is seen as a key concept in the CIRSS project. That is, 
a strategy for the creation of data sets' and accompanying analysis techniques 
that allow data sets to be meaningfully applied at various jurisdictional 
levels, corresponding scales, and resolutions. 
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It is important to recognize that vertical integration requires as much 
emphasis on the existing data bases as it does on the Landsat data; inte- 
gration requires clear understanding of where the data are going and how to 
enter the data. This integration may employ either **bottom-up" or **top- 
down" techniques depending upon the particular data sets, their character- 
istics, institutional frameworks, and available processing technologies. 
Examples of "bottom-up” versus "top-down" integration include: 

1. Land Ownership Data 

The original data collection is done by the county or city 

% 

agency where the data is often used as individual records. Data 
flow in the upward direction would generally be accompanied by either 
geographical or categorical aggregation. 

2. Satellite Data 

The initial data collection is done through hi^ altitude sensors 
employing ultra-high technology with highly centralized processing and 
analysis functions. The data is passed downward at the original reso- 
lution and information content. Spatial disaggregation, in a. sense, 
might occur with increasingly accurate geometric corrections. Categorical 
and spatial data integrity could be achieved by modeling relationships 
with locally collected data of higher resolution. 

This paper examines two prototypical procedures for Integrating Landsat 
and traditional geo-based data files. In the first case, geo-based grid 
files were transferred to the Electromagnetic Systems Laboratory (ESL) Inter- 
active Digital Image Manipulation System '(IDIMS) at NASA/Ames. In the second 
case, image files from IDIHS were transferred to 9 traditional GBIS created 
and operated by ESRI. Before discussing the integration procedures used for 
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this pilot project, this paper will address the data sources, automation 
procedures, and creation of the grid cell data base at ESRI, and \;111 
also address the data sources and classification scheme of the Landsat 
image at NASA/ ARC. The integration procedures are then discussed along 
with problems and solutions for this integration study. Conclusions of 
this pilot study are stated in the final section of this report. 



STEPS IN THE CREATION OF THE GEO- BASED INFORMATION SYSTEM 
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I. Geographical Definition of the Study Area * 

ESRI chose as a pilot study area, a portion of the Redlands, 

California 7.5 minute U.S.G.S. quadrangle. The study area was 7072 
hectares (approximately twenty-four square miles), occupying the central 
75% of the quadrangle. The north-west coordinate of the study area was 
located at the UTM coordinates of 479,000 E and 3,773,000 N, and an 
80 meter square grid pattern was aligned with the UTM map grid. The 
area included the central business district, residential areas, outlying 
urban fringes, citrus orchards, chaparral-covered foothills, and valley 
bottom floodplains. For such a small geographic area, the data types per 
varlable^were quite diverse. 

II. Mapping of the Data 
A. Data Sources 

The following data sources were used as collateral information: 

1. Low altitude, false color, infrared imagery at a scale of 
1:24,000. 

2 . Existing maps . 

a. 1974 Land Use Maps that were pirepared at ESRI for Southern 
California Edison. 

b. Census tracts, 
c • Zoning • 

d • General Plan • 
e* Assessor's Parcel, 
f. U*S»G.S« Topographic. 
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3 • Existing Reports . 

a. Soil Conservation Service report on prime agricultural 
lands • 

B. Creating the 1978 Land Use Map 

All the data variables composing the 1978 Land Use Map were 
reformatted to a consistent scale of 1:24,000; the imagery was used 
to update and verify the existing maps. A manuscript map was drafted 
on stable base mylar which contained all the data variables. There- 
fore, the manuscript map consisted of sub-areas (polygons), eacjh with 
an associated code string which numerically describes each variable. 
The product of this cartographic effort is called an Integrated 
Terrain Unit Map (ITUM)^. For geographic referencing tic marks, 
representing known latitude and longitude locations, were drafted on 
the manuscript map. There were four tics on the map, one at each 
corner . 


1 An ITUM is generated by superimposing several sets of data and 

drafting a map of the polygons created. Each polygon then represents 
multiple data. 
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III. Digitizing 
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Once the map has been drafted and prepared, the map Is digitized, 
which is the process of capturing the polygon configurations in a series 
of cartesion coordinates which preserves the spatial integrity of the 
mapped data. The manuscript map was taped to the digitizing machine 
surface and a cross-hair cursor passed over the line segments composing 
* the polygons. On the 1978 Land Use Map, there were approximately 2,300 
polygons for the entire 7.5 minute quadrangle. 

* 

IV. Automation 

After the digitized file was obtained, a series of ESRI user-oriented 
software was used to transform the data, which has been referenced to 
known latitude and longitude coordinates, into an error-free file which 
is an automated copy of the original manuscript map.^ The codes were then 
associated with their respective polygons, broken into distinct variables, 
and a uniform grid superimposed atop the polygon files for each variable. 

A square grid cell of 80 meters was chosen to minimize the effects of 
resampling the Landsat file. This procedure of transforming each variable 
into a grid cell structure allows for the overlay of individual variables 
which represent the same geographic area. This procedure allows for the 
application of varied and complex modeling procedures for the analysis of 
natural hazards, constraints, resources, and for the spatial definition of 

3 

land capabilities and suitabilities. 

2 The actual computer programs and techniques used to produce the error- 
free file will not be discussed in this paper. 

3 It should be noted that this mapping effort and study was initially done 
to aid the development of a growth management study for the City of 
Redlands * 
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The following is a list of the variables in the GBIS which was 


created by 

ESRI: 


1) 

1974 Land Use 


2) 

1978 Land Use 


3) 

Enumeration Districts 


4) 

Status of Undeveloped Land 


5) 

Zoning of Undeveloped Land 


6) 

Existing Areas of the General 

Plan 

7) 

Proposed Areas of the General 

Plan 

8) 

Fire Hazard Areas 


9) 

Flood Prone Areas 


10) 

Agricultural Soil Potential 


11) 

Soil Depth 



(See Figures 1, 2, 3, 4 for polygon maps and grid cell maps of the 1974 
and 1978 Land Use data as automated by ESRI.) 
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Figure 1 1974 Land Use 
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V. Conversion to Image Format 

The next step involved transporting the grid data to NASA/ARC for 
use on the ESL Interactive Digital Image Manipulation System (IDIMS) . 

The ESRI data sets were prepared, using a format believed to be compatible 
with IDIMS transfer function tapes for multiple four-band Images. This 
method was not immediately successful. The tape was again processed at 
NASA/ARC, this time read as groups of four single-band images. This 
effort was also unsuccessful. Multiple record blocking, which should 
have solved the problem had not yet been implemented on the IDIMS system 
at NASA/ARC / 

A portion of this problem was traced to a spurious short record at 
the beginning of file one. By specifying single-band processing with 
labels for the first file, it was possible to read the data as 100 x 113 
vertical strips from an implicit AOO x 113 image. (See Figure 5 for a 
graphical illustration of how the files were read as four strips of 
width 100 with starting line sample locations of (1,1) (1,101) (1,201), 
and (1,301).) 

Succeeding files were similarly read, without label processing. Even- 
tually, all the ESRI files were placed on the IDIMS transfer tapes for 
future operations. A more efficient format has been designed for future 
data transfers. 


4 Problems encountered in this step are characteristic of the IDIMS 
and may or may not apply to all image processing systems. 
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ESRI Grid Cell Files* as read by IDIMS 
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PROCEDURES IN PREPARATION AND CLASSIFICATION OF THE LANDSAT DATA 
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I . Obtain Landsat Data . 

Personnel at NASA Ames Research Center obtained a raw Landsat data 
tape which was generated by Jet Propulsion Laboratory (JPL) , using scenes 
from August 1976. The data had bulk geometric correction and had been 
spatially and spectrally mosaicked into 1° by 1^ quadrangles. The data 
was then classified at NASA/ARC using unsupervised clustering on the 
ILIAC IV computer. This classification was done as part of the California 
Department of Forestry (CDF) project for the entire state with the «tate 
being divided into ecozones.^ The San Bernardino West quadrangle (con- 
taining Redlands) has three defined ecozones which are Desert, Los Angeles 
Basin, and South Border. For these ecozones there was a total of 167 
clusters. 

II. Define the Study Boundary 

This step was performed by using the IDIMS and defining a window of 
the San Bernardino West quadrangle, which included the study area. The 
resulting window was a subset of the entire quadrangle and was 250 lines 
(rows) of 200 samples (columns) . The parcel size was 80 meters nominal 
and map grid geometry was Lambert Conformal Conic with standard parallels 
at 33° and 45°, rotated such that grid north was 11° east of due north 
along 117°west longitude. Windows of both the classified and -raw data 
(4 band) were created. 


5 Specific issues and procedures used in this classification project 
are not addressed in this -paper. 
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III. Registration of Data Set to UTM Grid . 


i 4 


Ci 


A. Digitizing Control Points . 

Within the study area, eleven control points were located. Using 
the IDIMS display curser command, the line-sample locations for each 
of these control points were displayed and recorded for later use. 

With the aid of an orthophoto quadrangle of the Redlands area, these 
control points were also located on the 7.5 minute quadrangle. Using 
the IDIMS geographic entry system (GES) , the control points were 

digitized into a central geoblock for further use in IDIMS. The 

* 

central points were typically street intersections, corners of cleared 
areas, intersections of airport runways, parking lots, and stream 
junctions. Image position estimates were done to the nearest whole 
pixel; actual locations were digitized on the map. 

B. Creation of Transformation Records . 

In this study, three transformation records were generated for 
registration purposes. Using IDIM’s functions TRANSFORM and ALLCOORD 
utility programs, the GES control points were registered to the 
Landsat Control Points (Tl), the GES control points were converted 
to UTM raw and column equivalents and registered to the UTM grid 
(T2), and the UTM grid equivalents of the GES control points were 
registered to the Landsat control points (T3) . Transformation matrix 
Tl is the most critical as it relates line sample values to latitude- 
longitude coordinates. For the registration process a third degree 
polynomial function was employed which, for the worst case, yielded 
a control point value that was one pixel width in error. 
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C. Geometric Correction and Registration . 

Using IDIMS function REGISTER, and the above transformation 
records, the window from the classified image was rotated and re- 
sampled using a nearest neighbor algorithm. After this procedure 
the study area (except three pixels) was contained within the 
Los Angeles Basin ecozone, and Included 40 unsupervised clusters 
or classes. It contained 113 lines of 100 samples each with 80 
meter pixels aligned with the UTM coordinate system. 
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IV, Conversion of Landsat Data to GBIS Format . 

Personnel at NASA/ARC created a magnetic tape which was compatible with 
the ESRI computer facility. * Software was written which converted the Landsat 
data to a GBIS file format. There were no significant problems with this data 
transfer. This Landsat data was later Integrated into the ESRI GBIS. This 
will be discussed in the next section. 
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FT,OW CHART OF VERTICAL INTEGRATION PROCEDURES 
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INTEGRATION OF DATA SETS 
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Successful Integration is eventually dependent upon the ability of the 
users to satisfactorily meet data needs. This achievement is dependent upon 
the complex interaction of a specific problem and local data characteristics 
of involved data systems. In a "perfect world", user data needs would shape 
data characteristics, and in turn determine system characteristics. In 
reality, data sets often either predate computer systems or are shaped by 
system capabilities. Fiscal and institutional constraints place additional 
limits on how the data sets may be Integrated. % 

Three basic options for integration of data sets are: 

1. Conversion of a geographic based information system (GBIS) file to 
Landsat image format, with subsequent processing in an image-processing 
environment . 

2. Conversion of Landsat Images to GBIS format, with subsequent processing 
on the GBIS. 

3. Treatment of both a GBIS and Landsat data set in a hybrid system or 
through a least common denominator compromise. 

Each option has its own set of problems as a result of conventional and 
Landsat data characteristics and the design limitations of the data system. 

Only the first two of these options were explored in this pilot study. 

A flow chart of these procedures is Illustrated in Figure 6. 
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I. Integration of GBIS Files with Landsat Data . 

This project . successfully integrated a grid cell data base with an 
image system format.^ 

Image systems, such as IDIMS, while being able to efficiently carry 
out image-related functions are often limited in their ability to mani- 
pulate external data contained in a geographic data base. With IDIMS, 
two options are available for external data entry: 


1. Digitization of map materials using the geographic entry system 
which is a direct component of the image processing system.* A 
possible solution to this option would be the conversion of 
"outside users" data to a format that the ESL digitizing equip- 
ment uses. But, internal data structures used by ESL, while 
enhancing system efficiency, are very difficult to replicate from 


externally generated data. This option would require considerable. 




labor for data bases of any large content. 





2. Conversion of GBIS files to pseudo-images. This was the option 
that was used for this integration effort. This option required 
that GBIS data files be converted to pseudo-images for use on 
IDIMS. As images are most effectively handled as byte data with 
values of 0-255, data values or code identifiers must be within 
this range. Nominal or labeled data sets must be converted to 
an index. 


6 Procedures or problems encountered are related to the IDIMS and may or 
may not be* representative of all image processing systems. 
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Image to image manipulation functions are extensive although their 
immediate application to geoprocessing problems arc not clear. For 
example, some assessor's parcel data were digitized and automated with 
block number and parcel number Identifiers. With IDIMS, the parcel 
data were overlaid with the Landsat data to determine the utility of 
this form of operation. The objective of this operation was to assign 
Landsat data values to the parcel. The usefulness of this type of data 
related to the parcel has obvious merits especially for county data 

systems. The resulting composite was only marginally useful, as parcel 

% 

codes were not unique outside a particular block. Some conclusions can 
be made however. The coverage of the parcel data should have been full 
which implies that streets and other "non-parcel" data should be included . 
in the data base. Parcel numbers or identifiers should be unique, even if 
this requires a certain redundancy in the data. 

Also in the internal workshop at NASA/ARC, Landsat classified data 
was observed and compared with other geo-based files, such as 1974 land use. 
Contingency tables were generated, comparing both the land use file with 
classified Landsat data from the California Division of Forestry (CDF) pro- 
ject, which show the number of occurrences of codes between the two files 
and also calculating the expected value of these codes. Two contingency 
tables are included in this report: 

1) Table One compares the 1974 land use file with the classified 
Landsat data. 

2) Table Two compares the 1978 land use file with the classified 
Landsat data. 
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Table !• 197A Land Use vs. Classified Landsat Data 
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Using both of these tables, one could Identify the changes in code 
frequencies between the files Included in the contingency tables. 

Each classified Landsat cluster was named for the major constituent 
class from the 1978 land use file. The 197A land use file was also observed 
when change was apparent in the area. Comparing the 197A and 1978 land 
use files was complicated somewhat by the fact that the minimum polygon 
resolution was smaller in the 1978 land use file than the 197A file. If 
no single class was clearly dominant for the Landsat cluster, a compound 
name (i.e. orchard/brush) was used. From this study it was apparent that 
wide varieties of association existed between the geo-based files and 
the Landsat data. For example, one Landsat cluster labeled "brush** fell 
94% into **orchard** in the land use file. At the other extreme, a cluster 
labeled **urban** in the classified file fell 86% into orchard of the land 
use file. This illustrates the problem of applying cluster labels developed 
for a large area (the Los Angeles Basin Ecozone) into some smaller consti- 
tuent area. Table 3 illustrates a comparison of Landsat cluster values 
before and after the integration process and clearly demonstrates a> finer 
breakdown of classification which resulted from the integration of the 
geo-based land use files with the Landsat data. 

In another experiment, image "masks** were created, using a Boolean 
logic function. All 1974 land use pixels with agricultural uses were con- 
verted to a byte value of 127 and all other uses to a zero value. Using 
an ADD function, this mask was overlaid on the Landsat classified image. 

The result of this was an image that was background for all non-agricul- 
tural uses, showing Landsat classified values for agricultural areas only. 
The resultant image was used in a crude method of change detection between 
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COMPARISON OF LANDSAT CLUSTER 
VALUES BEFORE AND AFTER INTEGRATION PROCESS 


Land sat 

Cluster CDF Group 


Group Classification 
After Integration 


88 

Grass 

90 

Barren 

92 

Barren 

93 

Barren 

103 

Grass 

106 

Brush 

107 

Brush 

108 

Brush 

109 

Brush 

110 

Brush 

111 

Urban 

112 

Brush 

113 

Brush 

114 

Urban 

115 

Urban 

116 

Urban 

117 

Urban 

118 

Urban 

119 

Urban 

120 

Urban 

121 

Urban 

122 

Grass 

123 

Grass 

124 

Grass 


Grass/Golf Courses & Schools 
Barren/Sand and Gravel 
Barren/Sand and Gravel 
Barren/ Sand and Gravel 
Grass/Golf Courses 
Orchards 

Steep Undeveloped 

Orchards and Brush Including Abandoned 
Orchards 

Brush % 

Orchard/Transitional 

Orchard 

Brush/Orchard 

Or char d /Res id en t ial 

Ur ban/Or chard 

Res id exits / Orchard 

Commercial 

Residential 

Residential/Mixed Urban 
Commercial 
Residential 
Commercial 

Residential/Transitional or Grass 

Pavement 

Pavement 
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1974 and 1976. However, there were a large number of "false positives” 
due to the smaller effective minimum mapping unit employed in the Landsat 
classified image. 

Using another IDIMS function, Polaroid positives as well as reproducible 
negatives were created using a DICOMED film recorder for the 1974 land use, 
1976 Landsat classes, and the 1976 Landsat masked only for agricultural 
areas. Comparable, but not identical, color schemes were used to facili- 
tate visual image comparison. A notable feature is the visual comparison 
problem caused by the relatively fine texture of the Landsat data. 
Reproductiom of these films have been done, but are not Included in this 
report . 

As a final experiment, terrain data derived by JPL from DMA/NCIC 
digital terrain files were also registered to the study area grid. As 
the terrain data was already aligned with the Landsat data by JPL, no 
additional control points were needed and the existing transformation 
equivalencies were applied. Several problems were seen in the resultant 
images. Nearest neighbor resampling is probably not appropriate for 
terrain data. As an example, registered terrain data did not yield the. 
same slope value for many pixels, due to the nearest-neighbor resampling 
method changing the values of adjacent pixels used for slope calculations. 
Similarly, nearest neighbor resampling distorted the JPL aspect data for 
the same reason. Additionally, the aspect data from JPL was oriented 
toward Lambert grid north, not true north. When resampled, it was still 
relative to Lambert grid north, and therefore of little apparent value in 
a UTM geo-based file or the registered and resampled Landsat file. Aspect 
data pointed from true north would have resampled correctly. 
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Performing data Integration in the IDIMS environment has major 
advantages and disadvantages. Landsat unsupervised cluster labelling 
is greatly facilitated by immediate, interactive access of registered, 
collateral files. The interactive display allows for rapid evaluation 
of various cluster grouping and mapping strategies to maximize final 

n.'“ 

image/product information content. Processing times' fast, and allow for 
rapid evaluation of various modeling constructs and sensitivity analyses 
in both quantitative and qualitative frameworks. Unfortunately, IDIMS’ 

ability to perform complex modeling is severely constrained by. a cumbersome 

% 

method of Boolean relational modeling; while an experienced user might 
be able to successfully concentrate various image-manipulation functions 
required for such moedling, the occasional user would likely be overwhelmed. 
Another serious problem is the inability of the system to accept external 
polygon files as input or produce polygon files as output for other systems. 
IDIMS is also largely incapable of handling external files in the case where 
image data values may simply serve as pointers to such a file, i.e., an 
image of soil series names/codes that reference an extensive seriesrspecif ic 
table of soil characteristics* 
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Personnel at ESRI also successfully Integrated the Landsat Image Into 
a G6IS data base. Initially It Is seen that the Landsat data could serve 
two functions. First, the Landsat data could be used as collateral data 
to verify the photo interpretation used in establishing the GBIS data base. 
Secondly, the Landsat data, once accurately classified, could be used in the 
modeling processes of land capability and land suitability studies. At the 
time of this publication, the first of these options had not fully been 

explored, but considerable work has been done with using the Landsat data 

% 

for grid cell modeling. For example, the Landsat data was used in the GBIS 
data base to generate grid cell area calculations for various types of land 
use between 1974, 1976, and 1978. Figure 7 displays a grid cell map that 
was generated with existing ESRI software of the Landsat data. 

Although image systems are generally similar in their treatment of data 
sets, GBIS vary considerably not only in their internal data representation, 
but also in the way the data sets are manipulated. Several basic data base 
concepts are in common use: 

1. Regular grid cell, either square or rectangular (ESRI data). 

2. Triangular or other non-rectangular grid cell. 

3. Polygon systems (either closed loop or intersection-chain networks). 

4. Mixed systems. 

Each system will have its own set of technical problems when dealing with 
integration. The following subset of problems will be common to nearly all 
systems: 

A. Effective Spatial Resolution : Most GBIS data bases employ a data 

configuration other than an 80 meter square pixel* A mechanism (such as 
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Figure 7 - Classified Landsat Data 
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in IDIMS) for resampling, aggregating, dr other routines is necessary 
to assign classified Landsat values to GDIS geographic units. Users 
will often say that Landsat ’s nominal 80 meter resolution is too coarse 
for their applications; yet classified image products may be criticized 
as being too detailed. This effect is related to the more discipline- 
related problem of spectral versus spatial classification and resolution. 

One solution might be in an algorithm designed to "group” classified data 
into geographically contiguous areas to simulate the minimum mapping unit 
criteria applied by photo interpreters. ^ 

B. Geographic Referencing ; Two issues are important: 

1. Projection: GBIS will generally use either state plane grid 

coordinates, universal transverse mercator (UTM) grid coordi- 
nates, or latitude- longitude coordinates. Geometrically cor- 
rected Landsat data will be in whatever projection was used to 
generate the transformation equation. In the CDF project a 
Lambert Conformal Conic projection was. used. While this pro- 
jection may be useful for a statewide mosaic, it is not useful 
for small areas. Projection changes may be accomplished utili- 
zing either the geometric properties of the projection or a 
polynomial transformation, depending upon the situation. 

2. Geometric Precision: When using Landsat data with other data 

sets, it is extremely important to know the geographic location 
of any given pixel. The degree of precision to which this 

can be accomplished has both practical (time, cost, etc.) as well 
as theoretical (integrating scanner) limitations. 
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1. Regular Grid Cell: If grid cell orientation and projections 

are the same, integration is simple. However, misalignment 
problems would be very difficult to solve in a grid system. 
Software for performing grid data base rotation and resampling 
is not included in most grid-oriented systems. Solutions 
range from image rotation and rescaling as in image-based 
systems to manual grid cell encoding from hard copy images. 

2. Non-Rect angular Grid Cell: This would likely require an' even 

more complex interpolation routine. Other considerations are 
similar to the regular grid cell discussion above. 

3. Polygon Systems (closed loop): Integration would likely require 

a point- in-polygon interpolation routine. Landsat data might 
have to be converted to polygon data sets, a non- trivial operation 
Manual re-digitizing might be useful as a technique, but quite 
cumbersome. 

A. Polygon. Systems (Intersection-Chain Networks): Thi’s is probably 

the most difficult interface without some system hybridization 
and shares the same problems with closed loop. 

5. Mixed Systems: A number of polygon systems are in fact mixed 

systems, with integrated polygon- to-gr id conversion routines and 
grid cell manipulation capabilities. Such systems would seem to 
offer the grea test interfacing potential with a minimum of new 
software design* (The ESRI software package is a good example; 
there are others, including the USGS Geographic Information 
Retrieval and Analysis System currently under development.) 
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Performing Landsat integration in the traditional GBIS environment also 
has its strengths and weaknesses. Processing speeds are slower compared 
to image systems, as most grid-based systems are not designed to manipulate 
very large data files. ^ They are designed to manipulate sample data files 
which is typical of Landsat. Modeling capabilities in conventional systems 
are generally more user-oriented, and display and model analysis rarely 
occur in an interactive mode. Traditional outputs Include printer maps, 
electrostatics, and DICOMED images. 

Landsat *8 relatively small cell size will often require aggregatd.an 
when integration occurs. Hence, informational detail will be lost in 
systems using much larger, e.g. 40 acre grid cells. Another potential 
problem in the integration process is that most GBIS are Incapable of per- 
forming precise registration between two grid data sets with. dlslmllar map 
projections, cell sizes, and grid orientations. 


7 These data files are considered to be 1000 x 1000 or greater. 
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Resource planning/management responsibilities and activities at all levels 
of government have greatly increased the demand and use of environmental data. 
Governments are increasingly turning to computerized geo-based Information 
systems to handle these large volumes of data. Dealing with the costs of these 
data sets and data systems is becoming increasingly difficult as governments 
face rising costs and limited revenues. 

One potential answer to these problems involves vertical data integration, 
defined as the creation of data sets and accompanying analysis techniques that 

V 

may be used by various levels of government. Vertical data integration should 
allow cost-sharing of data sets and analysis techniques, minimum duplication, 
and especially with high-technology data sources like Landsat, considerable 
potential for realizing scale economies and improved technology transfer. 

Based on our experiences of these two integration efforts, the evaluation 
of any Landsat-GBIS integration process requires: 

1. Investigation of the conventional data sources. 

2. Investigation of the characteristics of Landsat data products. 

3. Characteristics of data systems and analysis techniques used in the 
integration process. 

Landsat data has some known advantages as a large area data source, such as 

1) digital multi-band foinnat which allows for low-cost spectral classifica- 
tion; 

2) uniformity within the data set and' compatibility between data sets; and 

3) timelines and currency which can provide a. significant temporal components c 
These advantages are offset by problems relating to the Landsat product itself - 
such as format, registrar ion/ geometric correction, image quality and resolution. 
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as well as with classifications of the digital image which may be project or 
discipline-specific. Landsat may be interfaced with conventional data through 
a number of operating modes, including: 1) manual photo interpretation with 

subsequent manual or semi-automatic entry of resulting graphic data into a 
GBIS; 2) utilization of digital products in image-based system; or 3) utiliza- 
tion of digital products or derived data bases with non-image systems. Evalua- 
tion of these modes of data integration requires the investigation of several 
geographic data handling problems, such as polygon-grid conversion, scale and 

projection change, image manipulation, and hybrid systems, within the context 

« 

of data system limitations. 

These issues will be investigated within the context of actual county 
government operations in California with emphasis placed on integrating Landsat 
derived data into existing data bases used to analyze environmental problems. 
County governments represent a major potential user community with a pressing 
requirement for environmental and resources data; additionally, their GBIS 
activities may be taken as representative of similar activities undertaken by 
both regional and local governments. 

Evaluation of any proposal for the vertical integration of Landsat data into 
the California governmental or private usage pattern requires addressing and 
resolving a number of technical and institutional issues which include: 

a) The degree of centralization of digital image preprocessing activities. 

b) The degree of centralization of interpretation (digital analysis) 
activities. 

c) An evaluation of the utility of various product types at various levels 
of user sophistication. 

d) Determination of the flexibility versus efficiency trade-offs in genera- 
ting digital products for use at various governmental levels. 

e) Determination of the limiting effects of spatial resolution and geo- 
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referencing of Landsat data sets and near-future satellite sensors in 
urban areas with particular application to change detection potential. 

f) Evaluation of the feasibility of a centralized, stage government 
operated Landsat processing utility function. 

A test site of San Bernardino County has been selected where county, state, and 
federal agencies will participate in a NASA sponsored project of vertical inte- 
gration to study these issues. 

The objectives of the project may be grouped as follows: 

a) Describe and evaluate the parameters of conventional data sources, 

Landsat products, and both operational and near-operational GBIS systems 
that influence the technical feasibility of Landsat integration. Such 
analysis will likely concentrate on issues relating to geographic 
referencing and classification techniques. 

b) Investigate the data requirements of environmental planning and resource 
management activities of county government with emphasis on those disci- 
plinary areas where Landsat has the greatest application potential. 

c) Select a set of test problems with emphasis on development and demon- 
stration of techniques for integrating Landsat data into locally operated 
GBIS. First year work will evaluate the utility of classified Landsat 
digital data provided by higher levels of government . 

d) Demonstrate project findings through the generation of agency products 
from GBIS systems utilizing integrated Landsat data. 

e) Document the development of data parameters, system descriptions, appli- 
cation selection, and test results, including costs and benefits. 

f) Describe implications of test results as potential for future technology 
transfer, related applications, and future data system development. 
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